[FIX] test_hymba by ZX-ModelCloud · Pull Request #2872 · ModelCloud/GPTQModel

ZX-ModelCloud · 2026-05-09T14:03:32Z

Summary

Fix test_hymba

What Changed

1.shared_kv_cache_dict was only populated when reuse_kv=True.

That breaks models like Hymba where not every decoder layer has reuse_kv=True: earlier layers may need to publish KV for later layers even when the current layer itself does not consume kv_last_layer. In those cases,
prev_kv stays empty by the time a later layer actually needs it.

This change adds a model-level write_shared_kv_cache switch on BaseQModel, keeps the default behavior unchanged, and enables it for Hymba.

2.hymba is compatible with transformers v5.

2. hymba is compatible with Transformer v5. Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>

Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>

ZX-ModelCloud added 4 commits May 9, 2026 20:56

1. Backward compatible with versions of transformers < v5

608b2b1

2. hymba is compatible with Transformer v5. Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>

fix(hymba): allow shared KV cache writes without reuse_kv

7d39a5a

Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>

cleanup

a07621e

Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>

patch HymbaConfig.attn_implementation to flash_attention_2 or sdpa.

d3ba715

Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>

github-code-quality Bot found potential problems May 9, 2026

View reviewed changes

Comment thread gptqmodel/utils/hf.py Fixed

cleanup

d53145e

Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai>

Qubitium merged commit 8cf7ed7 into main May 11, 2026
6 checks passed

Qubitium deleted the zx_fix_hymba branch May 11, 2026 01:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX] test_hymba#2872

[FIX] test_hymba#2872
Qubitium merged 5 commits into
mainfrom
zx_fix_hymba

ZX-ModelCloud commented May 9, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ZX-ModelCloud commented May 9, 2026

Summary

What Changed

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants